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ABSTRACT 

Alkaline exonuclease and single-strand DNA 
(ssDNA) annealing proteins (SSAPs) are key com- 
ponents of DNA recombination and repair systems 
within many prokaryotes, bacteriophages and 
virus-like genetic elements. The recently sequenced 
p-proteobacterium Laribacter hongkongensis (strain 
HLHK9) encodes putative homologs of alkaline 
exonuclease (LHK-Exo) and SSAP (LHK-Bet) 
proteins on its 3.17 Mb genome. Here, we report 
the biophysical, biochemical and structural charac- 
terization of recombinant LHK-Exo protein. 
LHK-Exo digests linear double-stranded DNA mol- 
ecules from their 5' -termini in a highly processive 
manner. Exonuclease activities are optimum at pH 
8.2 and essentially require Mg 2+ or Mn 2+ ions. 
5 -phosphorylated DNA substrates are preferred 
over dephosphorylated ones. The crystal structure 
of LHK-Exo was resolved to 1.9 A, revealing a 
'doughnut-shaped' toroidal trimeric arrangement 
with a central tapered channel, analogous to that 
of /.-exonuclease (Exo) from bacteriophage-).. 
Active sites containing two bound Mg 2+ ions on 
each of the three monomers were located in clefts 
exposed to this central channel. Crystal structures 
of LHK-Exo in complex with dAMP and ssDNA 
were determined to elucidate the structural basis 
for substrate recognition and binding. Through 



structure-guided mutational analysis, we discuss 
the roles played by various active site residues. 
A conserved two metal ion catalytic mechanism is 
proposed for this class of alkaline exonucleases. 

INTRODUCTION 

The accurate repair of broken or damaged double- 
stranded DNA (dsDNA) molecules by homologous 
recombination (HR) is a process common to all living 
organisms (1,2). The process of HR involves genetic 
exchange (crossover) between two dsDNA molecules via 
stretches of DNA that share significant sequence hom- 
ology. In prokaryotes, it plays pivotal roles in chromo- 
some repair, replication and segregation (3,4). HR also 
promotes horizontal gene transfer between prokaryotes 
and resident plasmids, bacteriophages or assorted mobile 
genetic elements (5). Consequently, many prokaryotes 
have acquired HR-promoting proteins of viral/phage 
origin in addition to their 'endogenous' DNA 
recombinational-repair machinery (6). 

Two-component HR-promoting proteins of viral/phage 
origin, comprising an alkaline exonuclease and a 
single-stranded DNA (ssDNA) annealing protein [SSAP, 
also referred to as a synaptase (7)] are widely distributed 
throughout the prokaryotic world (6,8-10). In these 
systems, the alkaline exonuclease protein fully digests 
one strand, or partially digests both strands of the 
linearized dsDNA substrate; generating ssDNA mol- 
ecules, or long ssDNA 'tails' several hundred nucleotides 
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in length. These are 'coated 1 with the partnering SSAP 
protein, which promotes their annealing with complemen- 
tary regions of ssDNA on the chromosome or episome 
(11-13). This most likely occurs at the replication fork 
(14-16). DNA repair and replication proteins process 
the genetic intermediates formed, resulting in the 
creation of fully double-stranded recombinant DNA mol- 
ecules (6,7,14). The best studied of the bacterial-based, 
two-component exonuclease and SSAP systems are the 
'Red' proteins Exo (A-exonuclease) and Bet from 
bacteriophage-! (7,17); RecE and RecT from the 
Escherichia coli Rac prophage (7,12,18,19); G34.1P 
and G35P from bacteriophage SPP1 (13,20,21); gp60 
and gp61 from Mycobacteriophage Che9c (22,23); and 
SXT-Exo (s066) and SXT-Bet (s065) from the SXT 
genetic element of Vibrio cholerae (24-26). Over recent 
years, various partnering pairs of exonuclease and SSAP 
proteins have been utilized in 'recombineering' procedures 
to create genetic alterations within a variety of prokaryot- 
ic organisms; including E. coli, Salmonella, Pseudomonas 
and Mycobacteria (22,27-29). In these systems, the exo- 
nuclease and SSAP proteins mediate HR-type events 
between PCR-generated dsDNA 'targeting cassettes' 
(encoding the desired genetic change) and the bacterial 
chromosome or resident episome (14). 

First isolated and characterized in the 1960s (30,31) 
(and references therein), the activities and biophysical 
properties of the A-exonuclease protein have been 
extensively studied. Phylogenetically related to exonucle- 
ases from the Baculoviridae and Herpesviridae families, 
it is the prototypical member of the alkaline exonuclease 
family (8), which belongs to the restriction endonuclease- 
like PD-(D/E)XK superfamily (32). Its activities are 
optimal at pH 9.2-9.5, and are dependent upon Mg 2+ 
cations; although Mn 2+ can substitute with a 3- to 4-fold 
reduction in catalytic efficiency (31). 1-exonuclease binds 
to the termini of linear dsDNA molecules, and 
processively digests the 5'-ended strand to produce long 
3'-ssDNA tails, with the concomitant liberation of 
5'-mononucleotides. It has previously been estimated 
that an average of 3000 (33) or 18 000 (34) nucleotides 
are digested without dissociation of the enzyme from the 
undigested strand. It has a strong preference for dsDNA 
molecules containing 5'-phosphorylated (5'-P0 4 ) ends, 
digesting them an estimated 2- to 100-fold more effectively 
than the corresponding dephosphorylated (5'-OH) 
substrates (35-37). Its average digestion rate has been 
estimated to be ~4nt/s (35), 9nt/s (38), 15-20nt/s (39), 
32nt/s (34) or MOOOnt/s (40), using a variety of bulk or 
single-molecule scale biophysical approaches. 

The crystal structure of A-exonuclease was reported 
in 1997 (41), revealing an unusual toroidal (doughnut- 
shaped) trimeric arrangement with a central tapered 
channel. This funnel-shaped channel had an inner 
diameter of ~30A at one end, which is wide enough to 
accommodate a dsDNA substrate. However, it was 
only ~ 1 5 A wide at the narrow end, which is only 
capable of accommodating ssDNA. Each monomer con- 
tained a Mn 2+ ion housed within a pocket lining the 
channel, which presumably equate to the active sites that 
normally accommodate catalytic Mg 2+ ions. This led the 



authors to propose a mechanism whereby the dsDNA sub- 
strate enters at the wide opening, and the non-hydrolyzed 
3'-strand exits at the opposite end. 

The crystal structure of the catalytically active 
C-terminal domain of the functionally equivalent RecE 
exonuclease was recently reported (38). RecE formed an 
analogous toroidal tetrameric arrangement, with a central 
tapered channel that contained the catalytic residues. 
Although A-exonuclease and RecE share little overall 
primary sequence similarity and have quite distinct 
multimeric arrangements, the central channels within 
these two proteins have a highly similar size and 
topology. This strongly suggests a conserved mode of 
action where the terminus of the dsDNA molecule enters 
at the wide end of the channel; the strands are separated, 
and the 5'-end is processively hydrolyzed as the protein 
encircles and translocates along the undigested 3'-strand 
(38,41). 

As part of our ongoing efforts to characterize prokary- 
otic DNA recombination proteins, we identified two 
adjacent genes within a suspected prophage region on 
the genome of Laribacter hongkongensis HLHK9, which 
encode homologs of alkaline exonuclease (LHK-Exo, 
LHK_01497) and SSAP (LHK-Bet, LHK_01496) 
proteins (42). Laribacter hongkongensis is a recently 
identified species of (3-proteobacteria that is a potential 
diarrheal pathogen (43,44). Here, we describe the func- 
tional and structural characterization of LHK-Exo, 
which shares 29% primary amino acid sequence identity 
with A-exonuclease. The crystal structure of LHK-Exo 
was solved to 1.9 A resolution, revealing a 'doughnut-like' 
toroidal structure, similar to that previously observed for 
1-exonuclease (41). LHK-Exo functioned as a DNA exo- 
nuclease, digesting linear dsDNA with a clear preference 
for 5'-phosphorylated ends. Under the guidance of the 
structure of LHK-Exo in complex with DNA, a set of 
active site point mutants were constructed and function- 
ally characterized to probe sequence-structure relation- 
ships. We compare the activities and properties of 
LHK-Exo with those of related nucleases; and discuss 
mechanisms for substrate recognition and binding, as 
well as a potential catalytic mechanism for this class of 
alkaline exonucleases. 



MATERIALS AND METHODS 

Materials 

All buffer salts, metal salts, chemicals and biochemicals 
were purchased from Sigma-Aldrich or GE Biosciences. 
PicoGreen reagent was from Invitrogen (cat.# P7589). 
Unmodified and phosphorothioate modified oligonucleo- 
tides were purchased from TechDragon Ltd. (Shatin, 
Hong Kong). Where applicable, oligonucleotides were 
5'-phosphorylated using T4 polynucleotide kinase 
(NEB), according to the manufacturer's instructions, 
then desalted on Auto Seq G50 columns (GE 
Biosciences). All restriction enzymes were purchased 
from New England Biolabs (NEB; Ipswich, MA, USA). 
pET28a and pET32a plasmid DNA were from Novagen; 
pUC18 was from Stratagene and pMAL-c2 was purchased 
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from New England Biolabs. Restricted (linearized) 
dsDNA was routinely purified using a Qiaquick Gel 
Extraction kit (Qiagen GmbH). When applicable, 
linearized dsDNA was 5'-dephosphorylated using calf in- 
testinal phosphatase (NEB) then gel purified. dsDNA was 
resolved by electrophoresis on agarose/Tris-acetate 
EDTA (TAE) gels and stained with ethidium bromide 
(Sigma). ssDNA was analyzed by denaturing polyacryl- 
amide gel electrophoresis on 7 M urea-TBE polyacryl- 
amide (12%, 37.5:1) gels, which were stained using 
SYBR Gold Nucleic Acid Gel Stain (Invitrogen). Gels 
were visualized by UV-transillumination on a ChemiDoc 
XRS molecular imaging system equipped with Quantity 
One v4.6.6 software (BioRad). A 1 kb Plus DNA Ladder 
(Invitrogen) was used as a marker for dsDNA, and Oligo 
Length Standards 20/100 Ladder (Integrated DNA 
Technologies Inc., USA) was used for ssDNA gels. 
Graphs were prepared using SigmaPlot 10.0 (Systat 
Software, Inc). 

Methods 

Cloning, expression and purification of LHK-Exo. The 
LHK-exo gene (LHK_01497; GenBank: NC.012559.1, 
1455161. .1455772) was PCR amplified from Laribacter 
hongkongensis HLHK9 genomic DNA (Expand 
Polymerase, Roche) using the LHKexoforl (ATAT CCA 
TGG AACAACGCAC) and LHKexorev2 (TTAA AAGC 
TTGGCGGCCTTTCGTTTCAG) primers. The amplified 
gene product was ligated via Ncol/Hindlll (restriction 
sites underlined) into pET28a (Novagen) to create 
plasmid pEXNHl, which encodes a 13aa C-terminal 
hexa-histidine fusion (. . .KLAAALEHHHHHH*). 
Escherichia coli BL21 (DE3) containing pEXNHl was 
grown in Luria-Bertani (LB) medium (2 1) containing 
50ug/ml kanamycin (37°C), to an OD 600 of 0.6-0.8, 
induced with IPTG (isopropyl-fi-D-thiogalactopyranoside, 
0.2 mM), then incubated at 25°C for 10 h. Bacterial cells 
were harvested by centrifugation (6000g, 30min), washed 
with PBS buffer, then resuspended in PBS buffer contain- 
ing protease inhibitors (Complete, Roche) and lysed by 
sonication (ice cooling). The supernatant obtained after 
centrifugation (15 000 g, 30min) was filtered (0.45 um 
syringe filter, Millipore) then applied to a Ni 2+ -charged 
5 ml Hitrap Chelating FF column (GE Healthcare) 
pre-equilibrated with PBS buffer. The column was 
washed (PBS buffer containing lOmM imidazole, 20 ml) 
and the protein was eluted from the column using a linear 
gradient of imidazole (25-500 mM) in PBS buffer. 
Fractions containing the purest protein were pooled and 
concentrated (Amicon Ultra, Millipore) and buffer was 
exchanged using a G-25 Sephadex desalting column (GE 
Healthcare) pre-equilibrated with buffer A (20 mM Tris- 
HC1 pH 8.0, 1 mM DTT). Protein was then purified by 
anion exchange chromatography (Resource Q, GE 
Healthcare), eluting with a linear gradient of 0-1 M 
NaCl in buffer A. Fractions containing the purest recom- 
binant LHK-Exo protein were pooled then subjected to 
gel filtration chromatography (Superdex-200 HR, GE 
Healthcare) using buffer B (20 mM Tris pH 8.0, 150mM 
NaCl, 1 mM DTT). The purity of resultant LHK-Exo was 



greater than ~98% as determined by SDS-PAGE 
analysis. Single residue mutants of LHK-Exo were con- 
structed using the QuikChange Site-Directed Mutageneis 
kit (Stratagene) according to the manufacturer's instruc- 
tions; using plasmid pEXNHl as the template and the sets 
of oligonucleotides listed in Supplementary Table SI. 
Mutant proteins were expressed and purified as described 
above. 

Qualitative determination of LHK-Exo activities 
Mode of digestion. BamHI-linearized pET28a (1.8 ug, 
0.54 pmol) was incubated with LHK-Exo protein (30 ug, 
0.41 nmol of trimers) in 200 ul of Tris-HCl (pH 8.0, 
50 mM), 50 mM NaCl, 7.5 mM MgCl 2 at 37°C. Aliquots 
(20 ul) were withdrawn at various time points (indicated 
in the text) and analyzed on a 1% agarose/TAE gel. 

Polarity of digestion. The 712-bp dsDNA substrates 
('unmodified') were synthesized by PCR using pET32a 
as template, with two different pairs of primers, one 
pair of T7 (TAATACGACTCACTATAGGG) and 
T7rev (GCTAGTTATTGCTCAGCGG) for amplifica- 
tion of 'unmodified' substrate, and another pair of the 
phosphorothioate modified primers T7forPT3 (TAATA 
CGACTCACTATA S G S G S G) and T7revPT3 (GCTAGT 
TATTGCTCAG S C S G S G) for 'PT-modified' substrate 
(where subscript 'S' denotes a nuclease-resistant phos- 
phorothioate linkage between 2nt). Both PCR-amplified 
substrates were 5'-phosphorylated (T4 polynucleotide 
kinase, NEB) and purified (QIAquick PCR purification 
kit, QIAgen) prior to use. The two dsDNA substrates 
(0.1 ug, 0.23 pmol) were separately incubated with 6ug 
of lambda-Exo (74 pmol of trimers) or LHK-Exo 
(82 pmol of trimers) in Tris-HCl (25 mM, pH 8.0), 
7.5 mM MgCl 2 , 1 mM DTT (total volume 40 ul) at 37°C. 
Aliquots (20 ul) were quenched (20 mM EDTA+ 1% SDS) 
immediately and after 20min, then analyzed on 1% 
agarose TAE gels. 

Single strand DNA exonuclease activities. Phosphorylated 
[5'-PO 4 -(dT) 50 ] or non-phosphorylated [5'-OH-(dT) 50 ] 
forms of a 50-mer or oligothymidine (0.4 nmol) were 
incubated with LHK-Exo protein (4.5 ug, 61. 4 pmol of 
trimers) in 25 mM Tris-HCl, pH 8.0; 7.5 mM MgCl 2 ; 
ImM DTT at 37°C; total volume 80 ul. Aliquots (20 ul) 
were removed at 0, 0.5, 5 and 20min, and immediately 
quenched (30% glycerol, 7 M urea-TBE; 80 ul), prior to 
analysis by denaturing polyacrylamide gel electrophoresis. 

Quantitative determination of DNA exonuclease 
activities. Quenched PicoGreen DNA fluorescence 
assays were performed essentially as described previously 
(21,26,45) with minor modifications. Typical assays (60 ul) 
contained LHK-Exo protein (8ng, 0.11 pmol of trimers) 
and 30 ng linear pUC18 DNA (0.018 pmol) or 
EcoRV-linearized pMAL-c2 (0.0073 pmol) in nuclease 
assay buffer [Tris-HCl (25 mM, pH 8.0), 7.5 mM MgCl 2 , 
1 mM DTT]. Assays were initiated by the addition of 
linear dsDNA substrate, incubated at 25°C for 20min 
before being quenched by the addition of EDTA to 
20 mM. The 250-fold diluted PicoGreen reagent 
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(Invitrogen) was added to quenched reaction mixtures and 
fluorescence levels (which are proportional to the 
quantities of dsDNA present) were measured at an exci- 
tation 485 nm/emission 535 nm (Perkin Elmer 1420 
multi-label counter). 

Determination of optimal metal ion concentrations , pH and 
temperature for DNA e.xonuclease activities. Quenched 
PicoGreen fluorescence assays were performed, quenched 
and analyzed as described above with minor 
modifications. 

Determination of optimal Mg 2+ and Mn 2+ ion 
concentrations. Assays (60 ul) contained LHK-Exo 
protein (8ng, 0.11 pmol of trimers), Pstl-linearized 
pUC18 (30 ng, 0.018 pmol) in Tris-HCl (25 mM, pH 
8.0), MgCl 2 (1-15 mM) or MnCl 2 (0.1-1.7 mM) at the 
concentrations indicated in the text; and were incubated 
at 25° C for 20min. 

Determination of optimal pH. Assays (60 ul) contained 
LHK-Exo protein (8ng, 0.11 pmol of trimers), 
Pstl-linearized pUC18 (30 ng, 0.018 pmol), 25 mM Tris- 
HCl, 7.5 mM MgCl 2 ; adjusted to the appropriate pH 
value (pH 7.0-9.0); and were incubated at 25°C for 20min. 

Determination of optimal temperature. Assays (60 ul) con- 
tained LHK-Exo protein (40 ng, 0.55 pmol of trimers) and 
Pstl-linearized pUC18 (30 ng, 0.018 pmol) in nuclease 
assay buffer. All solutions were pre-equilibrated at the 
temperature indicated in the text (34-54° C) using a 
thermostat-regulated water bath; an incubation time of 
1 min was used. Four to six independent replicates of 
each experimental condition were performed. Data were 
fitted to hyperbolae using SigmaPlot 10.0 to calculate 
the k caX during the initial phase or reaction (digestion 
rate stated in nucleotides per second; per trimer of 
LHK-Exo protein), reported as the mean ± standard 
deviation. 

Determination of digestion processivity. LHK-Exo (6 ug, 
82 pmol of trimers) protein and 60 ng (0.015 pmol) 
of linear dsDNA substrate (5'-P0 4 -dsDNA, 
EcoRV-linearized pMal-c2; or 5'-OH-dsDNA, 5'- 
dephosphorylated EcoRV-linearized pMal-c2) were 
equilibrated in 180 ill of Tris-HCl (25 mM, pH 8.0), 
1 mM DTT, at 25°C for 5 min. A total of 90 ul of Tris- 
HCl (25 mM, pH 8.0), 1 mM DTT 15mM MgCl 2 was 
added to initiate the reaction; then exactly 1 5 s later, 
excess heparin (Sigma; 360 ug, 20 nmol) in 90 ul of Tris- 
HCl (25 mM, pH 8.0), ImM DTT, 7.5 mM MgCl 2 was 
added to sequester unbound LHK-Exo (total reaction 
volume 360 ul). Assays were incubated at 25°C for 
20 min, removing 60 ul aliquots at the times indicated in 
the text to determine dsDNA levels via fluorescent 
PicoGreen assays (as described above). Four to six inde- 
pendent replicates of each experimental condition were 
performed. Plotted data were fitted to hyperbolae using 
SigmaPlot 10.0 to obtain the digestion processivity from 
the extrapolated curves as previously described (26,36), 
which is reported as the mean ± standard deviation. 



Analysis of multimericity by analytical ultracentrifugation . 
Sedimentation velocity analytical ultracentrifugation 
experiments were performed using a Beckman 
Proteomelab™ XL-1 analytical ultracentrifuge 
(60000rpm, 262,000 g, 293 K). LHK-Exo protein was 
prepared at a concentration of 1 mg/ml in 1 50 mM 
NaCl. Data were analyzed using the c(S) and c(M) 
models in the Sedfit program (46) to determine the sedi- 
mentation coefficient and apparent mass distributions, 
respectively. 

Crystallization and X-ray data collection. The purified re- 
combinant LHK-Exo protein was concentrated to 20 mg/ 
ml in crystallization buffer (20 mM Tris-HCl, pH 8.0). 
Crystallization conditions were screened with the 
Hampton Research Screens kits. Crystallization was per- 
formed by the sitting-drop vapor-diffusion method at 
293 K, with 1 ul protein solution mixed with 1 ul reservoir 
solution and equilibrated with 100 ul reservoir solution 
containing 0.1 M Tris-HCl pH 8.4 and 1.0 M potassium 
sodium tartrate tetrahydrate. Good quality crystals were 
obtained within 2 days, and were flash cooled and stored 
in liquid nitrogen for future data collection. The X-ray 
diffraction data were collected to 1.9 A resolution at 
100 K at beamline BL5A of the Photon Factory (Japan) 
using an ADSC-Q315 detector. Reservoir solution con- 
taining 20% glycerol was used as cryoprotectant. The dif- 
fraction data were processed with the HKL2000 program 
suite (47). The LHK-Exo crystals obtained belong to 
space group P63 with unit cell parameters of: 
a = b = 108.9 A, c = 47.6A, a = (3 = 90° and y = 120°. 

To obtain substrate-bound complex structures, native 
LHK-Exo crystals were soaked at 293 K for 24 h in a 
solution containing dAMP or a 5'-phosphorylated 
pentamer of oligothymidine [5'-P0 4 -(dT) 5 ] at a final con- 
centration of 1 mM in 0.1 M Tris-HCl pH 8.4 and 1.0 M 
potassium sodium tartrate tetrahydrate. Immediately 
prior to data collection, crystals were soaked in cryopro- 
tectant solution consisting of 20% glycerol and the reser- 
voir solution. X-ray diffraction data were collected 
in-house at 100 K on a Rigaku R-AXIS IV ++ image 
plate using Cu Ka radiation (X = 1.5418 A) from a 
Rigaku MicroMax-007 rotating anode X-ray generator 
operating at 40 kV and 20 mA. All diffraction data were 
integrated, scaled and merged using the HKL2000 suite 

(47) . A complete summary of the data collection statistics 
is given in Table 1. 

Structure determination and refinement. The structure of 
LHK-Exo was solved by molecular replacement (MR) 
with the monomeric A-exonuclease (PDB code: 1AVQ) 
employed as a search model, using the Phaser program 

(48) in the CCP4 suite (49). Good quality electron 
density maps enabled initial model building by ARP/ 
wARP (50). Further model building and manual adjust- 
ments were facilitated using the Coot program (51) and 
structure refinement was performed by Refmac5 (52). The 
stereochemical quality of the structure model was moni- 
tored with the PROCHECK program (53). 

The substrate-bound complex structures were also 
determined by MR with the native enzyme employed as 
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Table 1. X-ray data collection and structure refinement statistics 
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a search model. The dAMP and DNA molecules were 
manually built and adjusted under the guidance of 
F 0 — F c difference maps using the Coot program (51). 
Structural refinement was also carried out using 
Refmac5 (52). A summary of the structure refinement stat- 
istics is provided in Table 1. 

RESULTS AND DISCUSSION 

LHK-Exo is an alkaline exonuclease that digests 
dsDNA with strict 5 - to 3' -polarity 

The LHK_01497 gene from the HLHK9 type strain of 
L. hongkongensis was cloned into the pET28a vector as 
a C-terminal hexahistidine fusion, and subsequently ex- 
pressed to high levels in E. coli. Three-step purification 
by immobilized Ni 2+ ion affinity, anion exchange and 
gel-filtration chromatography afforded multi-milligram 
quantities of LHK-Exo protein of high purity suitable 
for biochemical and structural characterization. 
Analogous to the phylogentically related 1-exonuclease 
and SPPl-Chu (G34.1P) proteins, recombinant 
LHK-Exo strictly required Mg 2+ or Mn 2+ ions for the 
hydrolysis of linear dsDNA molecules (Supplementary 
Figure SI) (20,21,31). Analysis of quenched aliquots 
from a time-course incubation of LHK-Exo with linear 
dsDNA by agarose gel electrophoresis, indicated that 
LHK-Exo was functioning as an exonuclease that did 
not release significant amounts of partially digested 
products (Figure 1A). This is consistent with the activities 
of A-exonuclease (30,54), SPPl-Chu (20,21), RecE (55) 
and SXT-Exo (26), that all function as highly processive 
dsDNA exonucleases. 

Modified dsDNA substrates (7 12 bp in length), con- 
taining three consecutive 'nuclease resistant' 



phosphorothioate linkages near the 5'-termini of both 
strands (PT modified) were synthesized using a 
PCR-based strategy, to probe the polarity of 
exonucleolytic digestion (Figure IB). Analogous dsDNA 
substrates with no backbone modifications (unmodified) 
were similarly synthesized as controls. By substituting 
phosphodiester linkages with phosphorothioate linkages 
near the 5'-termini, the activities of a 5'- to 3'-exonuclease 
should be severely inhibited, whereas the activities of a 3'- 
to 5'-exonuclease should be relatively unaffected. We have 
previously used this approach to probe the in vivo DNA 
hydrolytic activities of the SXT-Exo protein (26). 
A-exonuclease was included as a control, as this enzyme 
has previously been shown to hydrolyze DNA in the 5'- to 
3'-direction. LHK-Exo degraded negligible amounts of the 
(5'-phosphorylated) 'PT-modified' dsDNA substrate 
(Figure IB, lane 5), but fully digested the analogous 'un- 
modified' dsDNA substrate under the conditions tested 
(lane 3). 1-exonuclease exhibited analogous activities 
(lanes 6-9). These assays clearly demonstrated that 
LHK-Exo specifically digests dsDNA from the 
5'-termini; i.e. it hydrolyzes dsDNA with strict 5'- to 
3'-polarity. 

We probed the ssDNA hydrolysis activities of LHK- 
Exo using 5'-phosphorylated and non-phosphorylated 
forms of a 50-mer of oligothymidine [5'-PO 4 -(dT)5 0 and 
5'-OH-(dT) 5 o, respectively] as model substrates. LHK-Exo 
effectively hydrolyzed both 5'-phosphorylated and 
non-phosphorylated ssDNA substrates (Figure 1C and 
D); although the 5'-phosphorylated oligonucleotide was di- 
gested ~5- to 10-fold more rapidly. As was the case for 
dsDNA substrates, Mg 2+ ions were essentially required 
for activity (Supplementary Figure S2). Notably, the hy- 
drolysis of 5'-OH-(dT) 50 was strongly inhibited at high salt 
concentrations (1 M KC1), while the degradation of the 
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Figure 1. Qualitative analysis of ssDNA and dsDNA hydrolysis activities of LHK-Exo. (A) dsDNA exonuclease activities. Agarose gel showing 
aliquots taken (0-15min) from an incubation of LHK-Exo (30 ug, 0.41 nmol of trimers) and BamHI-linearized pET28a (1.8 (ag, 0.54 pmol) in Tris- 
HC1 (pH 8.0, 50 mM), 50 mM NaCl, 7.5 mM MgCl 2 at 37°C. (B) Polarity of dsDNA digestion. A total of 6ug of LHK-Exo (82 pmol of trimers, lanes 
2-5) or A. -exonuclease (74 pmol of trimers, lanes 6-9) protein was incubated with 0.1 ug (0.23 pmol) of a 712-bp linear 5'-phosphorylated dsDNA 
substrate ('unmodified'; lanes 2, 3, 6 and 7), or an analogous 5'-phosphorylated linear dsDNA substrate containing three consecutive 
'nuclease-resistant' phosphorothioate linkages at its 5'-termini ('PT-modified'; lanes 4, 5, 8, 9). Assays were quenched immediately (Omin) or 
incubated at 37°C for 20 min, before analysis of digestion products on 1% agarose gels. (C) Digestion of 5'-phosphorylated ssDNA. Reaction 
mixtures (80 ul) containing LHK-Exo (4.5 ug, 61.4pmol of trimers) and 5'-PO 4 -(dT) 50 (0.4 nmol) in 25 mM Tris-HCl (pH 8.0), 7.5mM MgCl 2 , 1 mM 
DTT were incubated at 37°C. Aliquots (20 ul) were withdrawn after 0, 0.5, 5 and 20 min, and immediately quenched. Reaction products were 
analyzed by denaturing gel electrophoresis. (D) Digestion of non-phosphorylated ssDNA. Analogous sets of assays were performed using 
non-phosphorylated 50-mers of oligothymidine [5'-OH-(dT) 50 ]. Fluorescent gel images were scanned after SYBR Gold staining. A ssDNA ladder 
[Oligo Length Standards 20/100 Ladder (IDT)] is included in lane 1. 



5'-phosphorylated form was less affected (Supplementary 
Figure S2). This is consistent with 5'-phosphorylated 
DNA molecules binding to the LHK-Exo protein more 
effectively than the corresponding non-phosphorylated 
ones, and consequently being preferred as substrates (see 
below), as is the case for the A -exonuclease, SXT-Exo and 
SPPl-Chu proteins (20,21,26,36). 

The dsDNA exonuclease activities of LHK-Exo were 
investigated in a quantitative manner using the sensitive 
PicoGreen fluorescent DNA stain, as has previously been 
used in analogous studies (21,36,45). As may be seen in 
Figure 2, LHK-Exo activities were optimal in the presence 
of Mg 2+ ions at a concentration of 7.5 mM (Figure 2A) or 



Mn + ions at a concentration of 1 mM (Figure 2B). This is 
roughly similar to concentrations of Mg 2+ ions utilized by 
related exonucleases of viral/phage origin: 1-exonuclease 
[1-2.5 mM (31)]; BGLF5 from the Epstein-Barr virus 
[EBV; 20 mM (56); 5mM (57)]; baculovirus alkaline exo- 
nuclease [4-8 mM (58)]; UL12 from Herpes simplex virus 
1 [2-5 mM (59)]; V-TREX from Choristoneura fumiferana 
nucleopolyhedro virus [5mM (60)]. Equivalent levels of 
dsDNA digestion were observed (within experimental 
error) when Mg 2+ or Mn 2+ ions were present at their 
optimal concentrations; suggesting that LHK-Exo can ac- 
commodate and utilize either ion equally well. This is 
notably different to the metal ion selectivities of related 
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Figure 2. Determination of optimum conditions for LHK-Exo dsDNA digestion activities. PicoGreen fluorescence assays were performed to 
quantify the amounts of a representative 5'-phosphorylatd dsDNA substrate (Pstl-linearized pUC18 that were digested by LHK-Exo under 
various conditions (reported as a percentage of the initial quantities of DNA). (A and B) Optimal concentrations of Mg 2+ and Mn 2+ ions. 
Amounts of Pstl-linearized pUC18 (30ng, 0.018pmol) digested by LHK-Exo (8ng, 0.11 pmol of trimers) in Tris-HCl (25 mM, pH 8.0) containing 
varying concentrations of Mg ions (1-15 mM, A) or Mn 2+ ions (0.1-1. 7 mM, B), respectively, upon incubation at 25°C for 20min. (C) Optimal pH. 
Amounts of Pstl-linearized pUC18 (30ng, 0.018pmol) digested by LHK-Exo (8ng, 0.11 pmol of trimers) in 25 mM Tris-HCl, 7.5mM MgCl 2 ; 
adjusted to the appropriate pH value (pH 7.0-9.0); upon incubation at 25°C for 20min. (D) Optimal temperature. Amounts of Pstl-linearized 
pUC18 (30ng, 0.018pmol) digested by LHK-Exo (40ng, 0.55pmol of trimers) in Tris-HCl (25 mM, pH 8.0), 7.5mM MgCl 2 , after incubation for 
1 min at the indicated temperature (34-54°C). Four to six independent replicates of each experimental condition were performed, and data are 
reported as the mean ± standard deviation. 



exonucleases; where activities have been reported to be 
~4- to 10-fold higher in the presence of optimal concen- 
trations of Mg 2+ ions, compared with optimal Mn 2+ ion 
concentrations (31,57-59). dsDNA degradation levels 
were highest at pH 8.2 (Figure 2C), and at a temperature 
of 41°C (Figure 2D). The optimal pH value for LHK-Exo 
is comparable to those determined for related exonucle- 
ases: A-exonuclease [pH 9.2-9.5 (31)]; SPPl-Chu [G34.1P; 
pH 9.2-9.5 (21)]; SXT-Exo [pH 8.2 (26)]; BGLF5 [pH 8.0 
(56); pH 8.5-9.5 (57)]; baculovirus alkaline exonuclease 
[pH >9 10 (58)]; UL12 [pH 9-10 (59)]; V-TREX [pH 
9.5 (60)]. When the pH and Mg 2+ ion concentrations 
were optimal, LHK-Exo hydrolyzed 5'-phosphorylated 
linear dsDNA at a rate of 7.0 ± 1.8nt/s (per LHK-Exo 
monomer, per DNA end; at 25°C; Table 2). This rate of 
hydrolysis is comparable to that previously determined for 
X-exonuclease under a variety conditions (34,35,38,39). 

LHK-Exo digests 5'-phosphorylated dsDNA in a highly 
processive manner 

The digestion processivity of LHK-Exo was determined 
by a 'heparin trap' method (61), which has similarly 
been used to characterize the X-exonculease (36), 
SPPl-Chu (21) and SXT-Exo (26) alkaline exonucleases. 
In this approach, a large excess of the heteropoly- 
saccharide 'DNA-mimic' heparin is added to assays con- 
taining exonuclease and dsDNA substrate shortly after 



Table 2. Rate and processivity of double strand DNA digestion 
mediated by wild-type LHK-Exo and mutants 



Proteins 


5'-PC 


4 dsDNA 


5'-OH dsDNA 


(nt/s) 


processivity (nt) 


^cat (nt/ s ) processivity (nt) 


WT 


7.0 ± 1.8 


2368 ± 72 


1.6 ± 0.5 805 ± 56 


R12A 


1.0 ± 0.3 


955 ± 38 


1.1 ± 0.3 567 ±39 


E68A 


2.1 ± 0.5 


720 ± 32 




K114A 


1.0 ± 0.3 


655 ± 45 




Y134A 


1.3 ± 0.4 


812 ± 47 




Y134F 


4.0 ± 0.8 


1140 ± 59 




Q137A 


1.8 ± 0.6 


1122 ± 114 




Q137E 


0.3 ± 0.1 


440 ± 52 





Analogous sets of quenched PicoGreen fluorescence assays were 
performed to quantify the respective rates and processivities with 
which representative 5'-phosphorylated dsDNA (5'-P0 4 -dsDNA) or 
5'-dephosphorylated dsDNA (5'-OH-dsDNA) substrates were digested 
by wild-type LHK-Exo or the specified active-site point-mutant under 
standardized conditions. See 'Materials and Methods' section for 
details. 



their initiation. This sequesters all unbound enzyme, and 
also prevents any disassociated enzyme from re-binding to 
initiate additional digestion events. DNA levels are subse- 
quently quantified using the fluorescent PicoGreen 
reagent, to determine the average number of nucleotides 
digested during a single DNA-binding event. Under 



9810 Nucleic Acids Research, 2011, Vol. 39, No. 22 



-m- WT + 5'-P0 4 -dsDNA 
-O- WT + 5'-0H-dsDNA 




n I I 1 r 

0 5 10 15 2( 

Time (min) 



Figure 3. Processivity of double strand DNA digestion by wild-type 
LHK-Exo and Argl2Ala mutant. Time course analysis of the digestion 
of 5'-phosphorylated double strand DNA (5'-P0 4 -dsDNA: EcoRV- 
linearized pMal-c2) and 5'-dephosphorylated double strand DNA 
(5'-OH-dsDNA: 5'dephosphorylated EcoRV-linearized pMal-c2) sub- 
strates by wild-type LHK-Exo and the Argl2Ala mutant form using 
a 'heparin trap' approach. A total of 6ug (82pmol of trimers) of 
LHK-Exo or Argl2Ala mutant protein was incubated at 25°C with 
60 ng (0.015 pmol) of 5'-P0 4 -dsDNA or 5'-OH-dsDNA in Tris-HCl 
(25mM, pH 8.0), 1 mM DTT, 7.5mM MgCl 2 . After 15s, excess 
heparin was added to sequester all unbound protein, and to prevent 
disassociated protein from re-binding. Aliquots were removed at 
various time points (0-20 min), and dsDNA levels were determined 
using fluorescent PicoGreen assays, to enable the extent of DNA di- 
gestion to be calculated. In one set of assays, heparin was added to 
LHK-Exo prior to the addition of dsDNA substrate, to confirm the 
efficacy of the heparin trap method (filled black squares, green line). 
Graphs show the mean number of nucleotides digested from each 
terminus (±SD; j'-axis) plotted against the time of analysis (in 
minutes; .Y-axis). 



optimal conditions (pH 8.0, 7.5 mM MgCl 2 ) at 25°C, 
LHK-Exo digested 2368 ± 72 nt from each terminus of 
5'-phosphorylated, blunt-ended dsDNA molecules 
(5'-P0 4 -dsDNA; EcoRV-linearized pMal-c2; 6646 bp in 
length; Table 2 and Figure 3). To determine whether pH 
had a strong influence on the digestion processivity, analo- 
gous sets of assays were performed at pH 6.8, 7.4 and 8.6 
(Supplementary Table S2). Results clearly indicated that 
processivity was almost entirely unaffected by changes in 
pH over this range. This contrasts markedly with the sig- 
nificant effect that pH has on the rate of dsDNA digestion 
(Figure 2); which is 40-60% lower at these suboptimal pH 
values. 

As the phosphorylation status of the 5'-termini has 
previously been shown to greatly affect both the rate 
and processivity of dsDNA digestion mediated 
by the SPPl-Chu and X-exonuclease proteins (20,21,36), 
we performed analogous sets of assays using 
5'-dephosphorylated EcoRV-linearized pMal-c2 as the 
dsDNA substrate (5'-OH-dsDNA; Table 2 and 
Figure 3). Results revealed that LHK-Exo trimers 
digested an average of 805 ± 56 nt from each 
5'-dephosphorylated termini, which is two-thirds lower 
than the corresponding value for 5'-dephosphorylated 
dsDNA substrates. Similarly, LHK-Exo digested these 
5'-dephosphorylated dsDNA substrates 5 times lower 



than the phosphorylated ones (1.6 ± 0.5 and 
7.0 ± 1.8nt/s, respectively). Taken together, these results 
reveal that the biochemical activities of LHK-Exo are 
analogous to those of phylogenetically related alkaline 
exonucleases; indicating there is high conservation of 
function within this protein family. 

Overall structure of LHK-Exo 

The purified recombinant LHK-Exo protein was 
crystallized using the sitting-drop vappr-diffusion 
method. Its crystal structure was solved to 1.9 A resolution 
by molecular replacement using the structure of 
1-exonuclease (PDB code 1AVQ) as a search model, with 
the crystallographic i? wor k and R{ ree converging to 20.4% 
and 24.1%, respectively (Table 1). The final structure 
includes one LHK-Exo monomer and 177 water molecules. 
LHK-Exo presents in the asymmetric unit as a monomer 
with residues 5-203 traced well in the model. Three expres- 
sion vector-encoded residues located at the C-terminus of 
the protein were also traced in the structure. The four 
N-terminal residues, as well as residues 28-32 and 55-57 
were unresolved, indicating that these regions may exist as 
flexible regions (discussed below). 

The overall structure of LHK-Exo adopts an a/ (3 
'pistol-like 1 fold including seven a-helices and four 
P-strands (Figure 4A). In addition to X-exonuclease, a 
DALI search identified another four related structures, 
including: a phage-related exonuclease from Haemophilus 
somnus 129PT (HSJ420; PDB code 3K93; Z-score 17.6; 
r.m.s.d. 2.4 A for 175 residues); the BGLF5 nuclease from 
Epstein-Barr virus (PDB code 2W4B; Z-score 11.4; r.m.s.d 
2.5 A for 168 residues) (62); the Shut-off and Exonuclease 
(SOX) protein from Kaposi's sarcoma-associated herpes- 
virus (PDB code 3FHD; Z-score 10.8; r.m.s.d. 2.4A for 
160 residues) (63) and the E. coli RecE protein (PDB code 
3H4R, Z-score 10.0, r.m.s.d. 3.4A for 152 residues) (38). 
The LHK-Exo, HSJ420, SOX and BGLF5 proteins 
belong to the X-exonuclease family; while RecE shares 
more similarity with the RecB family (8). All possess 
DNAse and/or RNAse activities, with the exception of 
the HS_1420 protein, which has not yet been functionally 
characterized. This suggests that a conserved catalytic core 
domain exists in this family, despite members sharing 
fairly low levels of sequence identity and possessing 
quite different molecular weights. Superimposing the 
structures of these nucleases with that of LHK-Exo 
enabled the conserved nuclease 'core domain' to be 
clearly identified. This comprised a central arrangement 
of p-strands surrounded by a-helices (Supplementary 
Figure S3, panels A-F). Insertions within this central 
domain, as well as N-terminal and C-terminal extensions, 
exhibited significant diversity in shape and size. 

LHK-Exo forms a toroidal trimer 

The five related nuclease structures mentioned above have 
diverse oligomeric states: being monomeric (BGLF5 and 
SOX), trimeric (A-exonuclease and HS_1420) and tetra- 
meric (RecE). Consequently, analytical ultracentrifuga- 
tion was performed to determine the oligomeric state of 
LHK-Exo. The sedimentation velocity showed a single 
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Figure 4. Crystal structure of LHK-Exo and the toroidal structure of the LHK-Exo homotrimer. (A) Ribbon diagram of the structure of the 
LHK-Exo monomer. The ot-helix, fi-sheet and loop regions are colored salmon-pink, light blue and wheat-yellow, respectively. The two bound Mg 2+ 
ions are shown as gray spheres. (B) Toroidal structure of the LHK-Exo homotrimer, which has a 3-fold axis of symmetry. The three constituent 
monomers are colored cyan, magenta and green. (C) Electrostatic surface potential of the 'doughnut-like' LHK-Exo trimer, with positively charged 
regions colored blue and negatively charged regions in red. Residues contributing to the positively charged regions are labeled (on one protein 
monomer). It should be noted that residues A33 and N58 located at the C-terminal ends of the two disordered loops, respectively, over-represent the 
amount of the positive charge in the central channel due to their free amino groups. The toroidal LHK-Exo structure is shown with the wide-end of 
the tapered central channel facing the front (same orientation as in B). (D) Interactions at the interface between two neighboring LHK-Exo protein 
monomers, which are colored green and cyan. The residues involved in the interactions are shown with stick representation, and the water molecules 
located at the subunit interface are shown as red spheres. Salt bridges and hydrogen bonds are indicated with dashed lines. 



4.4 S species corresponding to a mass of 79 ± 4kDa 
(Supplementary Figure S4). The calculated mass for a 
trimer of LHK-Exo proteins is ~74 kDa, indicating that 
LHK-Exo forms a stable trimer in solution, analogous to 
1-exonuclease. Although LHK-Exo presents in the asym- 
metric unit as a monomer, the trimeric conformation 
can be identified via a symmetric operation involving 
two neighboring molecules. Forming a 'doughnut-like' 
toroidal structure (Figure 4B), the constructed conform- 
ation of the LHK-Exo trimer is highly similar to that 
previously obtained for A-exonuclease (41). A total 
solvent-accessible surface area of 992 A 2 (9.1%) is buried 
at each monomer interface; formed mainly by a3, the a3- 
a4 loop and a7 in one molecule, and a5 and the P3-(34 
loop in the neighboring molecule (Figure 4D). 



Hydrophobic interactions between the neighboring 
subunits appear to be the main contributors towards the 
overall stabilization of the trimeric conformation. Two 
salt bridges: Glu53A-Argl58 B and Lysl26 B -Glul94 A also 
appear to play important roles (where the A and B sub- 
scripts refer to residues present on adjacent monomers). 
Buried water molecules located at the subunit interfaces 
may also play a role in trimer assembly, via the formation 
of hydrogen bonds. Wat237 links the 081 atom of 
Aspl62 B and the carbonyl oxygen atom of Lysl99A; 
Wat319 links the two carbonyl oxygen atoms of 
Aspl62 B and Ala202 A ; and G1u53a is connected to 
Thrll8 B through Wat226 and Wat231. The hydrophobic 
interactions are mostly concentrated toward the centers of 
the interfaces; whilst the two salt bridges and the hydrogen 
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Figure 5. The LHK-Exo active site and binding site for the 5'-phosphate group of the DNA substrate. A ribbon representation of the LHK-Exo 
structure is colored according to the scheme used in Figure 4A. The Mg 2+ ions and the water molecules are shown as gray and red spheres, 
respectively. (A) The residues involved in Mg 2+ coordination, and those putatively involved in DNA phosphoesterase activity are labeled, with their 
chains shown in stick form. (B) The binding of deoxyadenosine 5'-monophosphate (dAMP) with the phosphate-recognition pocket in LHK-Exo. The 
dAMP molecule is shown in green stick representation and the electron density (2F a -F c contoured at lo, calculated at 2.60 A) is shown in gray. The 
altered conformation (i.e. dAMP bound form) of the Argl2 residue side chain is shown in salmon-colored stick form, and its native conformation is 
shown in gray. (C) Surface representation of the dAMP-bound structure of LHK-Exo reveals a binding pocket for the 5' -phosphate moiety of the 
deoxynucleotide molecule. 



bond network mediated by the ordered water molecules 
are positioned toward the peripheries of the interfaces. 

A tapered central channel enables the passage of a single 
strand of DNA 

A tapered channel ~20A in length passes through the 
centre of the protein trimer (the 'doughnut hole'), whose 
diameter is ~35A at the wide end and ~20A at the 
narrow end (Figure 4C). This central channel appears to 
be more 'open' (i.e. slightly wider) at both ends than the 
corresponding channel present in the A-exonuclease 
trimer, due to the absence of the two (untraced) flexible 
loops. The RecE tetramer forms a similar central channel 
with a length of ~40 A, which is tapered from ~30 A at the 
wide end to ~10A at the narrow end (38). 

The striking similarities in structural organization 
shared between the A-exonuclease trimer, the RecE 
tetramer and the LHK-Exo trimer strongly suggests that 
they share a conserved mode of action. It has previously 
been proposed for A-exonuclease and RecE that the 
dsDNA substrate enters through the wide end of the 
channel, and the non-digested 3'-ended ssDNA exits 
through the narrow end (38,41). For the LHK-Exo 
trimer, the positive electrostatic potential is predominantly 
concentrated at both ends of the central channel. 
However, the central channel does not have a strong 
overall positive electrostatic potential, and the active site 
possesses a strong negative charged surface (Figure 4C). 
Highly similar patterns of electrostatic potential distribu- 
tion are also found within the central channels of the 
A-exonuclease and RecE multimers. This suggests that 
for this group of exonucleases, the binding of the 
5'-ended DNA strand (to be digested) may be primarily 
mediated by specific electrostatic and hydrogen bonding 
interactions. It also indicates that the ingress of the 
dsDNA substrate and the passage of the undigested 
strand out of the channel may be promoted primarily by 
electrostatic interactions. 



In the LHK-Exo trimer structure, the two untraced 
flexible loop regions are positioned at both ends of the 
central channel. Residues 28-32 (TKSGY) that form the 
untraced loop between a2 and a3, extend out from the rim 
of the wide end (entrance) of the channel. Untraced 
residues 55-57 (RFS) lie between oc3 and oc4 on the 
opposite face of the toroid, lining the narrow channel 
'exit'. In the A-exonuclease and RecE proteins, the two 
loops that correspond to these two stretches of untraced 
residues have been proposed to form important contacts 
with the incoming dsDNA substrate (residues 28-32), 
as well as the exiting strand of undigested DNA 
(residues 55-57) (38,41). From a functional perspective, 
it seems likely that these two loops are flexible in order 
to accommodate the translocation of the DNA molecule 
through the centre of the torroid, as the termini of the 
5'-strand are processively hydrolyzed. However, 
sequence alignments reveal that both of these stretches 
of untraced residues show very low levels of homology 
with related exonucleases (Supplementary Figure S5); 
with the loop between a2 and a3 (residues 28-32) being 
absent in some homologs. The Arg55 residue, which is 
located in the small untraced loop proximal to the active 
site, may project toward the centre of the channel exit; 
analogous to Lys76 in 1-exonuclease and Arg858 in 
RecE (38,41). Correspondingly, this residue may form im- 
portant binding contacts with the undigested 3'-strand as 
it translocates through the central channel of the enzyme. 

Coordination of active-site metal ions 

Active-site metal ions were not detected in the native 
crystal structures of either the A.-exonuclease or RecE 
proteins, and their positions were only identified after 
Mn 2+ ions were soaked into the crystals (38,41). 
Notably, in the native structure of LHK-Exo, two 
bound magnesium ions were identified in a negatively 
charged pocket within the active site (Figure 5A). One 
magnesium (II) ion (Mg^ + ) is coordinated to the Os2 
atom of Glull2 (2.3 A); the 082 atom of Aspl02 
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(2.4 A); the carbonyl oxygen atom of Ilel 13 (2.5 A); and to 
two water molecules, Wat383 (2.4 A) and Wat384 (2.3 A). 
The Mg^-binding site is essentially identical to the cor- 
responding metal ion-binding site in A-exonuclease, with 
the exception that two water molecules also contribute to 
the coordination geometry in our structure. It may also be 
noted that Mn 2+ ions are chelated by histidine residues 
located in analogous positions in the active sites of both 
the RecE (His652) (38) and RecB (His956) exonucleases 
(64). The second, putatively more weakly bound magne- 
sium ion (Mgg + ), is located 3.7 A away from Mg 2 ^. A 
corresponding metal ion is absent in the 1-exonuclease 
structure. Mgg + is coordinated by the Osl atom of 
Glull2 (2.7 A); the carbonyl oxygen^ atom of SerlOO 
(2.7 A); the 081 atom of Asp 102 (3.3 A); and two water 
molecules, Wat387 (2.7 A) and Wat388 (3.2 A). In contrast 
to the octahedral coordination geometry typically 
observed in Mg 2+ -bound enzymes, both Mg 2+ ions in 
the LHK-Exo structure have a partial octahedral coord- 
ination, leaving positions available for potential inter- 
actions with DNA substrates or water molecules during 
the catalytic process (see below). 

Based on the sequence analysis of exonuclease 
homologs, it has previously been proposed that the 
highly conserved Glu68 residue (Glu85 in 1-exonuclease) 
is a potential ligand for Mg 2+ ions (8). In our structure, 
Glu68 seems slightly too far from the Mg 2 ^ ion (4. 1 A) to 
act as a direct ligand; however, a water molecule (Wat384) 
forms a bridge between Glu68 and the Mg 2 ^ ion. To probe 
the putative involvement of this residue, the Glu68Ala 
mutant was constructed, expressed and biochemically 
characterized. This mutant protein hydrolyzed linear 
5'-phosphorylated dsDNA at a rate of 2.1 ± 0.5nt/s (pH 
8.0, 7.5 mM MgCl 2 , 25°C), which was 70% lower than 
that of the wild-type LHK-Exo enzyme (7.0 ± 1.8nt/s). 
However, the hydrolytic activities of the Glu68Ala 
mutant increased 1.6-fold (3.4nt/s) when the MgCl 2 con- 
centrations were increased to 20 mM, while those of the 
wild-type enzyme decreased ~35% to 4.7nt/s (data not 
shown). This indicated that the Glu68Ala mutant had a 
reduced affinity for Mg 2+ ions, which could be partially 
compensated for by increasing their concentration in 
solution. The coordination and functional roles of these 
two Mg 2+ ions are discussed further below. 

Recognition of 5'-phosphorylated DNA termini 

It has previously been shown that A-exonuclease digests 
dsDNA substrates that have a phosphate group at their 
5'-termini (5'-P04) significantly more efficiently than 
analogous (dephosphorylated) substrates lacking this 
group (5'-OH) (30,35). Subramanian et al. (36) subse- 
quently proposed that a phosphate ion observed in a 
pocket located 7 A away from the Mg 2+ ion in the 
1-exonuclease crystal structure most likely occupied the 
position of the 5'-phosphorylated terminus of the DNA 
strand to be digested (41). They identified the Arg28, 
Ser35, Serll7 and Glnl57 residues as forming key electro- 
static interactions with this phosphate ion; which are very 
highly conserved amongst A-exonuclease family members 
(Supplementary Figure S5). Mutation of the Arg28 



residue of ^.-exonuclease significantly reduced its digestion 
rate and processivity, and essentially abolished its ability 
to discriminate between 5'-phosphorylated and non- 
phosphorylated dsDNA substrates; demonstrating that 
it played a key role in binding the 5'-phosphate group. 
Contrastingly, an analogous residue is absent in the 
RecE structure (38), and correspondingly the phos- 
phorlyation status of the 5'-strand does not appear to sig- 
nificantly affect the rate of dsDNA hydrolysis (55). 

In the LHK-Exo structure, the putative phosphate- 
binding pocket incorporates the Argl2, Thrl7, Serl9, 
SerlOO and Glnl37 residues. To confirm the proposed 
involvement of these residues in binding the 
5'-phosphorylated DNA terminus, we determined the 
structure of LHK-Exo in complex with dAMP to a reso- 
lution of 2.6 A. The 5'-phosphate group of this 
deoxy-mononucleotide fits comfortably into the binding 
pocket, coordinated tightly by Argl2, Thrl7, Serl9 and 
SerlOO (Figure 5B and C; Supplementary Figure S6). 
Compared with A-exonucleasej the phosphate-binding 
site in LHK-Exo is located 2 A further away from the 
active site. Consequently, Thrl7 rather than Gin 137 
appears to be involved in stabilizing the phosphate 
group. An equivalent residue is present in A-exonuclease 
(Thr33), SXT-Exo (Thr22) and PLU2936 (Thr34), but is 
noticeably absent in other homologous proteins. 
Furthermore, the adenine base moiety of the nucleotide 
does not appear to form any specific interactions with 
LHK-Exo. It interacts loosely with Trp8 and Phe9, 
which may supply a flexible binding environment 
capable of accommodating the four different nucleobase 
moieties present in DNA substrates. 

Notably, Argl2 undergoes a significant conformational 
change upon dAMP binding (Figure 5B); adopting an 
orientation consistent with that of Arg28, which binds 
the phosphate ion in the A-exonuclease structure. To de- 
termine its putative involvement in binding the terminal 
nucleotide residue of the DNA chain to be hydrolyzed, 
the exonuclease activities of the Argl2Ala mutant 
were investigated. The mutant protein digested 
5'-phosphorylated linear (5'-P0 4 ) dsDNA substrates at a 
rate of 1.0 ± 0.3nt/s, which is 7-fold lower than that of 
wild-type LHK-Exo. However, it digested the analogous 
5'-dephosphorylated (5'-OH) dsDNA substrates at an 
essentially identical rate (1.1 ± 0.3 nt/s) (Table 2). 
Consistent with the proposed role of Arg28 in 
X-exonuclease (36), these results indicate that the corres- 
ponding arginine residue in LHK-Exo plays a pivotal role 
in discriminating between 5'-phosphorylated and 
non-phosphorylated forms of DNA. 

Structure of the LHK-Exo protein complexed with a 
ssDNA oligomer reveals a binding channel for the 
hydrolyzed DNA strand 

The A-exonuclease and RecE exonuclease proteins have 
the ability to distinguish and separate the two strands of 
the DNA duplex, and to specifically digest the 5'-ended 
strand in a highly processive manner (30,36,54,55). 
In order to better understand the mechanistic basis 
for 5'-stand recognition and binding, we soaked crystals 
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Figure 6. Structure of a ssDNA molecule bound within the active site of LHK-Exo. (A) The structure of LHK-Exo complexed with 
5'-phosphorylated pentamer of oligothymidine [5'-P0 4 -(dT) 5 ]. The oligonucleotide is shown in yellow-colored stick form and is covered by a 
2F a -F c electron density map contoured at la (calculated at 2.80 A). Bound Mg + ions are shown as gray spheres, and the residues that interact 
with the ssDNA molecule are labeled and shown in stick form. dAMP ligand (green) is superposed on the ssDNA molecule and the 5'-phosphate 
groups of dAMP and ssDNA bind in identical positions. (B) After mapping the electrostatic potential onto the protein surface, four residues of the 
ssDNA ligand (green) can be seen to fit comfortably into a binding channel within one of the LHK-Exo monomers. The 5'- and 3'-termini of the 
complexed ssDNA molecule are indicated. The trimer is shown with the wide-end of the tapered central channel facing the front, i.e. DNA enters 
from the front face. (C and D) Surface representations of the native structures of A-exonuclease (PDB code: 1AVQ) and the HS_1420 exonuclease 
from Haemophilus somnus 129PT (PDB code: 3K93), respectively. Both exonucleases contain similar channels (one is indicated with a yellow arrow) 
that may accommodate the 5' -end of the DNA strand to be hydrolyzed; with a more 'closed' channel observable for A.-exonuclease (C). Both protein 
trimers are shown orientated in a manner analogous to that of LHK-Exo in B, revealing an equivalent positioning of the ssDNA-binding channels 
adjacent to the central tapered channel. 



of LHK-Exo with buffer containing a synthetic 5-mer of 
oligothymidine that had been enzymatically phos- 
phorylated at the 5'-termini [5'-P0 4 -(dT) 5 ]. We were sub- 
sequently able to solve the structure of LHK-Exo- 
oligonucleotide complex to 2.8 A. We could clearly 
identify four complete nucleotide residues (T1-T4) in 
each monomer (Figure 6A and Supplementary Figure 
S6). The ssDNA chain fits comfortably into a channel 
formed by a4, a6 and the a5-(32 loop (Figure 6B). The 
Tl residue was located in the same site as the dAMP 



5'-mononucleotide was (see above); coordinated tightly 
by Argl2, Thrl7, Serl9 and SerlOO. The 5'-phosphate 
group (of Tl) was positioned identically to the 
5'-phosphate group of dAMP in the protein complex 
(Figure 6A). However, the 3'-OH of the Tl deoxyribose 
ring was located 1.5 A closer to the active site, and was 
hydrogen bonded to Mgg + (3.0 A). The fact that the 
dAMP and Tl oligonucleotide residues occupied the 
same binding site in the LHK-Exo crystal structure, 
suggests that this is the most stable nucleotide binding 
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site. This seems reasonable, as this is the residue that is 
hydrolytically cleaved from the DNA chain; being subse- 
quently released as a 5'-mononucleotide. 

The scissile phosphate between the Tl and T2 nucleo- 
tides was stabilized by the two Mg 2+ ions via the Ol atom. 
In addition, the Ol atom of the phosphate moiety between 
12 and T3 formed a 2.9 A hydrogen bond with the main 
chain amide nitrogen of Cysl 15. The Ol atom of the phos- 
phate moiety between T3 and T4 was stabilized by 
hydrogen bonds to the hydroxyl group of Thrl20 (2.6 A) 
and the main chain amide nitrogen of Asnll7 (3.1 A). 
Notably, no specific interactions appeared to be formed 
between the base moieties of the oligonucleotide and the 
LHK-Exo protein. This would be consistent with the exo- 
nuclease protein exhibiting little or no preference for 
dsDNA substrates with differing nucleotide compositions. 
We could not identify the T5 residue at the 3'-termini of 
the oligonucleotide in the complex structure, suggesting 
that this nucleotide does not form stable binding inter- 
actions with the enzyme. 

Our findings indicate that LHK-Exo recognizes and 
tightly binds at least four consecutive nucleotides at the 
5'-termini of the DNA strand to be digested (T1-T4), via 
the sugar-phosphate backbone. This stretch of residues at 
the 5'-termini is most probably separated from the com- 
plementary residues of the (undigested) DNA strand. 
However, it is difficult to extrapolate our results to 
estimate the degree to which the B-form DNA duplex is 
unwound by enzyme, or precisely how many base pairs are 
separated. Notably, our X-ray data is consistent with the 
findings of Dahlroth et al. (63), who modeled an oligo- 
nucleotide into the active site of the SOX exonuclease 
crystal structure. They predicted that SOX was also 
likely to accommodate 4nt. 

It should be noted that the soaking experiments did not 
lead to ssDNA molecule cleavage in our complex struc- 
ture, which was similarly observed for the EcoRV endo- 
nuclease (65). While LHK-Exo possessed notable ssDNA 
exonuclease activities in the presence of 7.5 mM Mg 2+ ions 
(in solution phase at pH 8.0, 37°C), there was negligible 
activity when Mg 2+ ions were omitted (Supplementary 
Figure S2). The crystal soaking experiments were con- 
ducted at high salt concentrations, at 20°C, in the 
absence of free Mg 2+ ions, which may have facilitated 
the retention of an intact tetranucleotide molecule within 
the active site. In addition, some quaternary structure 
changes may occur during the catalysis process that are 
in conflict with the crystal packing forces, which may be 
responsible for the prevention of ssDNA cleavage. It may 
also be the case that the position of the tetranucleotide 
molecule within the active site corresponds to a protected 
region of the substrate DNA as it is processed by 
LHK-Exo, analogous to a proposed ~ 1 1-bp protected 
region of DNA in A-exonuclease (41), although further 
experiments are needed to confirm this. 

A similar ssDNA-binding channel exists in 
1-exonuclease and the HS_1420 phage-related exonuclease 
from Haemophilus somnus 129PT, indicating that they 
may share a common binding mechanism for the 
5'-DNA strand (Figure 6C and D). However, the corres- 
ponding ssDNA-binding channel of the 1-exonuclease 



trimer appears to be slightly more 'closed' at the 
5'-phosphate recognition site, due to the presence of an 
extra a-helix and 3io-helix located at its N-terminus 
(Figure 6B-D). 

We further noted an additional binding pattern in the 
structure of the oligonucleotide-protein complex. The 
three central phosphodiester groups of the DNA chain 
(T1-T2, T2-T3 and T3-T4) are only coordinated to the 
protein via their Ol atoms, leaving their 02 atoms 
orientated toward the same inner-surface of the channel 
(Supplementary Figure S7). This surface is composed of 
Lysll4, Tyrl34 and Glnl37; three residues that are very 
highly conserved among X-exonuclease family members 
(Supplementary Figure S5). The distances between the 
three 02 atoms and the respective polar side-chain 
groups of these residues are maintained within 4.5 A. 
We speculated these three residues may play important 
roles in binding and orienting the 5'-DNA strand dur- 
ing its step-wise hydrolysis. To probe the role of these 
universally conserved residues, we prepared the 
Lysll4Ala, Tyrl34Ala, Tyrl34Phe, Glnl37Ala and 
Glnl37Glu mutants and characterized their activities. 
The Lysll4Ala and Tyrl34Ala mutants digested 
5'-phosphorylated dsDNA with rates of 1.0 and 1.7nt/s, 
respectively; which was roughly 15% and 25% that of the 
wild-type enzyme (Table 2). However, the digestion rate of 
the Tyrl34Phe mutant was 4.0nt/s, which was ~60% that 
of wild-type LHK-Exo protein. The processivity of diges- 
tion mediated by these three mutants were all roughly 
similar (~1100nt hydrolyzed per binding event), which 
is ~50% that of the wild-type enzyme. This data 
indicate that neither of these individual residues plays 
vital roles in DNA hydrolysis or translocation. However, 
the fact that the Tyrl34Phe mutant was significantly more 
active than the Tyrl34Ala mutant suggests that the 
presence of the benzyl side-chain moiety (compared to a 
methyl group) is beneficial for the overall protein archi- 
tecture, or at least plays an ancillary role in substrate 
binding or catalysis. The dsDNA digestion rate of the 
Glnl37Ala mutant was 4-fold lower than the wild-type 
(1.8 ± 0.6nt/s), while the Glnl37Glu mutant was 23-fold 
less active. This ~6-fold difference in catalytic activities 
between these two mutant forms most likely results from 
considerable electrostatic repulsion between the negatively 
charged pendant carboxylate group and the phosphate of 
the DNA substrate. Notably, it has been proposed that 
the corresponding glutamine residue in RecE (Gln781), is 
hydrogen bonded to Lys761; thereby modulating the 
orientation or catalytic activity of this highly conserved 
lysine residue (see below) (38). 

Proposed two metal ion catalytic mechanism for 
LHK-Exo 

Although the crystal structure of ^.-exonuclease was solved 
more than a decade ago, its precise catalytic mechanism 
remains to be established. Our crystal structure of the 
LHK-Exo protein in complex with a ssDNA molecule 
and Mg 2+ ions gives a highly informative snapshot of 
the active site of this exonuclease enzyme. In particular, 
the relative positioning of the two Mg 2+ ions suggests that 
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Figure 7. Conservation of catalytic site architecture between the LHK-Exo DNA exonuclease and the Bgll DNA endonuclease. (A) Binding inter- 
actions formed between the scissile phosphodiester group (blue arrow) of the oligonucleotide and the two Mg 2+ ions and active site residues in 
LHK-Exo. Two thymidine residues of the bound DNA molecule (Tl, T2) are shown in yellow-colored stick form. The two complexed Mg 2+ ions 
(gray spheres) and LHK-Exo active site residues (pale cyan-colored stick form) are labeled. Mg A + may contribute to the deprotonation and sub- 
sequent stabilization of the nucleophilic water molecule. Mgj^ could stabilize the scissile P-O bond and facilitate the departure of the ribose 3' 
hydroxyl moiety. Lysll4 is postulated to act as a conserved general base catalyst. (B) Architecture of the active site of the Bgll restriction endo- 
nuclease with two complexed Ca 2+ ions (PDB code: 1DMU). Adenosine and thymidine nucleotides of the bound DNA molecule are shown in pale 
green-colored stick form. Active site residues are shown in gray-colored stick form and the bound Ca + ions are shown as gray spheres. Lysl44 
has been proposed to act as a conserved general base catalyst, in a 'two-metal' catalytic mechanism. The two water molecules that putatively act as 
the nucleophile and general acid are colored red. The blue arrow indicates the scissile P-O bond. 



this protein may utilize a two-metal ion catalytic mechan- 
ism for DNA hydrolysis (Figure 7A). This has previously 
been proposed to occur in a variety of polymerases (66,67) 
and nucleases (68-72), but within diverse catalytic scen- 
arios. The two bound Mg 2+ ions are located approxi- 
mately 4 A away from each other, parallel to the scissile 
bond. Mg A + is tightly coordinated by Glull2, Aspl02, 
Ilel 13 and the scissile phosphate; but Mg| + is more 
weakly bound by Glull2, SerlOO, Asp 102 and the 
scissile P-O. In the presence of DNA, the two Mg 2+ ions 
are jointly coordinated by the side-chain carboxylate 
groups of two residues that are highly conserved within 
this protein family: Asp 102 and Glull2 (Supplementary 
Figure S5), as well as the 01 atom of the scissile phos- 
phate. It must be noted however, that neither of these 
magnesium ions have a fully occupied octahedral coord- 
ination in our structure. It is possible that water molecules 
crucially involved in the catalytic process may be missing 
due to the modest resolution of our data set (discussed 
below). As described previously, Mg A + could contribute 
to the deprotonation and subsequent stabilization of the 
nucleophilic water molecule. Consistent with the classical 
two-metal ion mechanism for a range of polymerases and 
nucleases, including A-exonuclease, Mg| + may stabilize the 
scissile P-O bond and neutralize the negative charge 
transferred to the 3'-hydroxyl group of the terminal 
nucleotide-5'-phosphate moiety, facilitating its departure 
(73). In addition, both metal cations may help to neutral- 
ize the negative charge associated with the pentavalent 
phosphate moiety transiently generated during the attack 
of the nucleophilic water molecule. 



In the proposed two metal ion catalytic mechanism, 
the highly conserved Lysll4 (equivalent to Lysl31 in 
1-exonuclease, Lys761 in RecE, Lysl44 in Bgll) may 
function as a conserved general base for the activation 
of the nucleophilic water molecule, which has previously 
been suggested (38,41). Mg A + and the (protonated) 
e-amino group of Lysll4 may stabilize the negatively 
charged pentacoordinate phosphate intermediate. 
Substituting Lysll4 for an alanine resulted in a ~85% 
loss in activity compared with wild-type LHK-Exo 
(1.0 ± 0.3nt/s compared to 7.0 ± 1.8nt/s; Table 2). Its di- 
gestion processivity was similarly reduced by ~75% 
(655 ± 45 nt compared to 2368 ± 72 nt digested per 
binding event). This indicates that while this highly 
conserved lysine residue may play an important role in 
facilitating phosphodiester bond hydrolysis, its individual 
functions are not essential. 

We further compared the DNA-bound structure with 
that of the Bgll restriction endonuclease in complex with 
a specific substrate DNA model and Ca 2+ ions (PDB code: 
1DMU) (74), in order to gage the similarity between 
LHK-Exo and a representative type II restriction 
enzyme that employs a two metal ion catalytic mechanism 
(Figure 7A and B). Two crucial water molecules bound in 
the active site of Bgll appear be involved in the catalytic 
process: the water molecule hydrogen bonded to Lysl44 
and metal I (Ca A + ) performs an in-line attack of the scissile 
phosphate; the other water molecule bound to metal II 
(Cag + ) acts as a general acid, protonating the leaving 
group. Unfortunately, it is not possible to identify any 
functionally equivalent water molecules in our ssDNA 



Nucleic Acids Research, 2011, Vol. 39, No. 22 9817 



bound structure of LHK-Exo due to the modest reso- 
lution and slightly smeared electron density. Furthermore, 
we are not able to identify the precise mechanism by which 
LHK-Exo promotes the formation and stabilization of 
the pentacoordinate phosphate intermediate. However, 
the analogous composition and orientation of the active 
site residues and metal ions within the Bgll and LHK-Exo 
crystal structures supports the operation of an equivalent 
two-metal ion catalytic mechanism. A more precise iden- 
tification and positioning of the crucial residues and water 
molecules within the LHK-Exo active site will require 
higher resolution structures in complex with ssDNA or 
even (partially) dsDNA model substrates. 

The biophysical, biochemical and structural data pre- 
sented here suggest that a considerable number of residues 
within the central channel of the LHK-Exo trimer are 
actively involved in binding, orienting and translocating 
the DNA substrate; as well as playing a direct catalytic 
role in the iterative hydrolysis of nucleotides from the 
terminus of the 5'-strand. Our results are consistent with 
the mutational analysis of the RecE exonuclease, where 
the precise roles of the individual active site residues 
proved similarly difficult to unambiguously establish, 
and there appears to be a significant amount of catalytic 
redundancy (38). While many key mechanistic points 
remain to be established, the structures of the 
mononucleotide and oligonucleotide-bound complexes of 
LHK-Exo described here are the first such structures 
reported for a member of this notable class of exonucle- 
ases. They provide structural evidence consistent with the 
operation of a two metal ion catalytic mechanism for this 
protein, and offer significant additional insight into the 
catalytic mechanisms of related nucleases of phage/viral 
origin. 

ADDENDUM 

While revising our manuscript, we became aware of an 
'early edition' article reporting two DNA-bound 
complexes of A-exonuclease (75). The authors used 
12-mer dsDNA (Form 1) or a 14-mer:12-mer DNA 
duplex (containing a 2 nt 5'-overhang; Form 2) for 
co-crystallization with the wild-type and catalytically 
inactive K131A mutant forms of 1-exo nuclease, respect- 
ively. Interestingly, the position of the ssDNA in our 
LHK-Exo complex structure overlaps precisely with the 
5'-end of the 14-mer DNA strand in the Form 2 structure, 
including the 2-nt overhang. The X-exonuclease: dsDNA 
structures reported by Zhang and colleagues are highly 
consistent with our LHK-Exo:ssDNA structure reported 
here. Taken together, our studies reveal that these two 
exonuclease proteins share an equivalent arrangement of 
metal cofactor and substrate bonding interactions, and 
employ a functionally identical two metal ion catalytic 
mechanism. 
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